Applying Statistical Methods to Machine Translation

نویسنده

  • Peter E. Brown
چکیده

A common paradigm in machine translation is analysis, transfer, and synthesis. In French-to-English translation, for example, a French sentence is analyzed into an intermediate structure in which various ambiguities present in the surface form have been resolved. This structure is then transferred to a similar English structure. Finally, an English sentence is synthesized from the intermediate English structure. Analysis, transfer, and synthesis each require considerable linguistic insight for their successful dispatch.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new model for persian multi-part words edition based on statistical machine translation

Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...

متن کامل

Optimization Strategies for Online Large-Margin Learning in Machine Translation

The introduction of large-margin based discriminative methods for optimizing statistical machine translation systems in recent years has allowed exploration into many new types of features for the translation process. By removing the limitation on the number of parameters which can be optimized, these methods have allowed integrating millions of sparse features. However, these methods have not ...

متن کامل

Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation

In this paper we investigate the challenges of applying statistical machine translation to meeting conversations, with a particular view towards analyzing the importance of modeling contextual factors such as the larger discourse context and topic/domain information on translation performance. We describe the collection of a small corpus of parallel meeting data, the development of a statistica...

متن کامل

Experiments with POS-based restructuring and alignment-based reordering for statistical machine translation

This paper presents the methods which are based on the part-of-speech (POS) and auto alignment information to improve the quality of machine translation result and the word alignment. We utilize different types of POS tag to restructure source sentences and use an alignment-based reordering method to improve the alignment. After applying the reordering method, we use two phrase tables in the de...

متن کامل

Using Example-Based MT to Support Statistical MT when Translating Homogeneous Data in a Resource-Poor Setting

In this paper, we address the issue of applying example-based machine translation (EBMT) methods to overcome some of the difficulties encountered with statistical machine translation (SMT) techniques. We adopt two different EBMT approaches and present an approach to augment output quality by strategically combining both EBMT approaches with the SMT system to handle issues arising from the use o...

متن کامل

EUSMT: Incorporating Linguistic information to Statistical Machine Translation for a morphologically rich language. Its use in preliminary SMT-RBMT-EBMT hybridization

We have proposed and successfully tested new techniques to deal with the problems found in applying Statistical Machine Translation (SMT) to language pairs with great morphological and syntactical differences. These techniques are based on segmentation and reordering and we have evaluated them in the context of Spanish-Basque translation. Dealing with morphology, we first proved that the qualit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993